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TITLE OF THE INVENTION 

Method for Normalizing the Relative Intensities of Detection Signals in 
Hybridization Arrays 

5 

FIELD OF THE INVENTION 

The present invention relates to the field of hybridization arrays. More 
specifically, the present invention concerns a method for normalizing signals to 
10 be compared in hybridization arrays. This novel method relies on the use of 
ribosomal RNA (rRNA) as an internal standard and allows approximation of the 
relative abundance of multiple mRNAs as well as direct comparisons between 
any two specific RNA samples. 

15 BACKGROUND OF THE INVENTION 

In DNA microarray experiments, one of the more popular ways to control for 
spotted DNA quantity and surface chemistry anomalies involves the use of two- 
color fluorescence (see refs. 4, 5). For example, a Cy3 (green)-labelled probe 

20 prepared from healthy tissue could be used as a control to examine expression 
profiles of a Cy5 (red)-labelled probe prepared from a tumor tissue. The 
normalized expression values for every gene would then be calculated as the 
ratio of experimental expression to control expression. This method can 
obviously eliminate much (but not all) experimental variation by allowing two 

25 samples to be compared on the same chip because there is enough DNA on 
each spot that both test and reference cDNAs can hybridize to it at once without 
interference. More sophisticated three-color experiments are also possible in 
which one channel serves as a control for the amount of spotted DNA, and 
channels two and three allow two samples to be compared to this control and 

30 to each other (see ref. 5). 

In addition to the local normalization method described above, more general 
methods are also available in the form of control spots on the slide. With a set 
of control spots, it is possible to control variations in overall slide quality or 
35 scanning differences. 



Applicable normalization strategies are based on some underlying assumptions 
regarding the data and the strategies used for each experiment. These 



strategies must therefore be adjusted to reflect both the system under study 
and the experimental design. A primary assumption is that for either an entire 
collection of arrayed genes or some subset of the genes (such as 
housekeeping genes), or for some added set of controls, the ratio of measured 
5 expression averaged over the set should be close to unity. 

The need for good methods of normalisation for microarray data can not be 
overstated (see refs. 6, 7). Depending on the experimental design, there are 
three useful approaches for calculating normalization factors. The first simply 
0 relies on the total fluorescent intensity measured. The assumption underlying 
this approach is that the total mass of RNA labelled with either Cy3 or Cy5 is 
equal. While the intensity for any one spot may be higher in one channel than 
the other, when averaged over thousands of spots in a given array, these 
fluctuations average out. Consequently, the total integrated intensity across all 
5 the spots in the array should be equal for both channels. Alternatively, one 
could add a number of controls in increasing but equimolar concentrations to 
both labeling reactions, and the sum of the intensities for these spots should be 
equal. 

A second approach uses linear regression analysis. For closely related 
samples, one would expect many of the genes to be expressed at nearly 
constant levels. Consequently, a scatter plot of the measured Cy5 versus Cy3 
intensities should have a slope of one. Measured intensities for added 
equimolar controls should behave similarly. Under this assumption, one can 
use regression analysis techniques to calculate the slope which is used to 
rescale the data and adjust the slope to one. 

A third approach has been described by Chen et al (1997) (ref. 1). In it, it is 
assumed that a subset of housekeeping genes exists and that for these genes 
the distribution of transcription levels should have some mean value and 
standard deviation that are independent of any particular sample. In this case, 
the ratio of measured Cy5 to Cy3 ratios for these genes can be modeled and 
the mean of the ratio adjusted to 1 . Chen and his collaborators describe an 
iterative procedure to achieve this normalization. Quackenbush and 
collaborators (ref. 2) have implemented their own algorithm and a variation 
thereof that uses the entire data set in a data visualization and analysis tool 
called TIGR ArrayViewer. Other statistical methods of determining data 
accuracy have been described (ref. 3, 11). 



The above procedures describe array-based measures that can be used to 
normalize data. However, even with multiple colour fluorescence and control 
spots, undesired experimental variation can contaminate expression data. It is 
also possible that some or all of the physical normalization techniques are 
missing from the experiment, in which case it is even more important to find 
additional means of normalization. 



The use of internal standards overcomes these problems. Using an 
exogenously added standard has the advantage of giving the user absolute 

10 control over the amount of template added, with no variation between samples. 
Using an exogenous standard does not, however, control differences in the 
quality of the starting RNA in a reverse transcription reaction. If there are 
differences in the levels of integrity of the RNA between otherwise identical 
samples, the yield of specific reverse transcriptase products will reflect this 

15 variation, although the external standards will still appear identical. For this 
reason, as well as for simplicity and reproducibility, an endogenous RNA 
standard should be favoured in microarray experiments. 

Theoretically, an ideal endogenous standard for a DNA microarray would be a 
20 transcript whose expression does not vary during the cell cycle, between cell 
types, or in response to the experimental treatments that one wishes to 
examine. Additionally, for an endogenous standard to be valid in a microarray 
it is crucial that it be of a similar relative abundance as the test and reference 
(or target) transcripts in the microarray. Unfortunately, such a molecule does 
25 not exist and there are serious limitations to the standards currently in use. For 
example, although beta-actin is a frequently used standard (refs 9, 10), its level 
of expression varies significantly from tissue to tissue. 

For DNA microarray experiments, mRNA is copied into cDNA with the use of 
30 reverse transcriptase so that the relative abundance of individual mRNAs is 
reflected in the cDNA product. Input RNA in reverse transcription reactions is 
usually quantified by spectrophotometry. The RNA that is used in a typical pre- 
reverse transcription reaction is total RNA, 80% of which is ribosomal RNA. The 
mRNA component of total cellular RNA can vary from 2% to 5% depending on 
35 the tissue, the remainder of the RNA consisting of tRNA or small nuclear RNAs. 
Therefore, even if a transcript is invariant (as expressed as a percentage of 
mRNA), its relative abundance would still vary when considered as a percent 
of the total input RNA from different source tissues. 



4 

Since the majority of the RNA is rRNA, the level of rRNA remains essentially 
constant from sample to sample. Because 18S and 28S rRNA make up the 
majority of optically absorbent material at OD 26 onm, they should make ideal 
invariant controls. In fact, 18S and 28S transcripts are frequently used as 
5 internal controls in northern hybridization, RNAse protection and quantitative 
RT-PCR assays (see ref. 8). However, the overwhelming abundance of rRNA 
is a major limitation to its utility as a control in DNA microarray experiments. 

In US Patent 6,057,134, Ambion describes a method to perform RT-PCR™ 
10 which allows an invariant transcript of any relative abundance such as an 18S, 
28S, or 5S ribosomal RNA, actin, or glyceraldehyde 3-P phosphate 
dehydrogenase RNA to be used as a control for any other transcript. This 
allows two targets of vastly different abundance to be quantified simultaneously 
in a multiplex RT-PCR™ reaction. Ambion uses blocked primers, or 
15 Competimers™, that compete with the unmodified primers for binding to a DNA 
template but cannot be used as primers for extension by a DNA polymerase. 
Thus, at each extension step in PCR™, a percentage of template is unavailable 
for amplification. By increasing the ratio of Competimers™ to primers in a 
PCR™ reaction, the amplification efficiency of an amplicon can be reduced so 
20 that the linear phase of accumulation of PCR™ product matches that of a less 
abundant target in multiplex PCR™. 

For a control to be usable for microarray hybridization, the intensity of the signal 
should be in the same dynamic range as the cDNA under evaluation. rRNA- 
25 derived cDNA has never previously proved useful as a control for microarrays 
probably because it is thousands of times too abundant compared to specific 
cDNA. 

OBJECTS OF THE INVENTION 

30 

An object of the present invention is therefore to provide an improved method 
for providing an internal standard for normalizing the relative intensities of 
signals in hybridization arrays, an improved method for normalizing perse and 
a method of hybridizing making use of the improved normalization. 

35 



SUMMARY OF THE INVENTION 
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More specifically, in accordance with the present invention, there is provided an 
improved method for providing an internal standard for normalizing the relative 
intensities of signals in hybridization arrays that is based on the use of 
ribosomal RNA (rRNA) as this internal standard. Ribosomal RNA has been 
5 found to be particularly suitable for this purpose because its abundance, in 
terms of percentage of total RNA, does not vary through the cell cycle or with 
a particular treatment. 

The method of the present invention may be summarized as follows. On a 
10 given DNA microarray, for example, an oligonucleotide specifically recognizing 
a sequence contained in ribosomal RNA is spotted along with the other DNA 
probes used to analyze gene expression, as is usual with this technique. The 
spots therefore essentially consist of capture probes. Ribosomal RNA, being 
of relatively invariant quantity in terms of percentage relative to total RNA 
1 5 provides a stable quantitative control to evaluate the quantity of other types of 
RNA. However, since it is also found in massive amounts relative to other 
RNAs, its level of detection by the technique must be toned down while 
remaining accurate. To that end, an experimentally-defined quantity of 
oligonucleotides carrying the same sequence as that of the oligonucleotide 
20 capture probe found on a spot of the microarray is added to the hybridization 
mixture so that the excess signal coming from the labelled rRNA (or from the 
cDNA generated from the rRNA, if cDNA hybridization is the method selected) 
is competed out and the signal detected for it is reduced to a range compatible 
with that of the signal for the other labelled RNAs. 

25 

Specifically, the present invention provides a novel method for providing an 
internal standard for normalizing the relative intensities of signals on a 
hybridization array, comprising: 

30 

adding a known quantity of an unlabelled ribosomal nucleic 
acid competitor probe into a hybridization buffer suitable for 
the array experiment, the competitor probe characterized in 
that it has the same sequence as at least portion of a capture 
35 probe present in the array for immobilizing ribosomal nucleic 

acids thereon; and 
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allowing the competitor probe to compete with a ribosomal 
capture probe for hybridization to a suitably labelled rRNA- 
derived cDNA of a cDNA sample, such that a hybridization 
signal of labelled rRNA-derived cDNA is decreased to a 
5 suitable signal dynamic range of detection and the rRNA- 

derived cDNA of the sample becomes a suitable internal 
standard for the hybridization array. 

The method of the present invention may further include: 

10 

determinating the quantity of hybridized rRNA-derived cDNA; 
and 

comparing the quantity of hybridized rRNA-derived cDNA 
15 against standard curves to determine the quantity of cDNA in 

said sample. 

The present invention further provides a normalization method, wherein the 
above steps for obtaining an internal standard are reproduced for a test sample 
20 using a first label, and for a suitably-labelled reference sample using a second 
label, and the quantity of hybridized rRNA-derived cDNA originating from the 
test sample is compared to the quantity of rRNA-derived cDNA originating from 
the reference sample hybridizing to the same capture probe to provide a 
normalization factor. 

25 

The present invention further provides a hybridization array, wherein the above 
steps for normalizing are reiterated and the normalization factor is used to 
correct a hybridization signal provided by the binding of a target cDNA of the 
test sample labelled with the first label to a capture probe specific to said target, 
30 which correction makes said hybridization signal directly comparable to a 
hybridization signal provided by the binding of the same target of the reference 
sample labelled with the second label to the same capture probe specific to 
said target. 

35 In a preferred embodiment, the rRNA competitor probe is present in a 
concentration that is about 5 to about 100 times that of the capture probe. 
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The rRNA-derived cDNA may be labelled by any suitable means, such as by 
3' addition of phosphate, or labelling with cyanines, biotin, digoxygenin, 
fluorescein, a dideoxynucleotide, an amine, a thiol, an azo (N 3 ) group or 
fluorine, or any other form of label. 

5 

An array comprising a plurality of spotted cDNA capture probes for binding 
ribosomal nucleics, alone or in combination with the competitor ribosomal probe 
in a separate component are further objects of this invention. The method of the 
present invention is suitable for use in high-throughput screening experiments. 

10 

It may be used for any type of array experiment, including but not limited to the 
identification of sequences found in the open reading frame of genes coding for 
transcription factors, such as c-Rel, E2F-1, Egr-1, ER, NFkB^rSO, p53, Sp1 and 
YY1. ^ 

15 

Other objects, advantages and features of the present invention will become 
more apparent upon reading of the following non restrictive description of 
preferred embodiments thereof, given by way of example only with reference 
to the accompanying drawings. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

In the appended drawings: 

25 

Figure 1 : A summary view of the described technology. Any given pool of total 
cellular RNA is usually composed of 80% ribosomal RNA (rRNA) and 20% 
messenger RNA (mRNA) and small nuclear RNAs. mRNA (except for the 
histone genes) is polyadenylated while rRNA never is. Making cDNA from both 

30 types of RNA by reverse transcription is possible if using a poly dT primer for 
mRNA (producing mRNA-derived cDNA, shown by solid arrows) and a specific 
primer for rRNA (producing rRNA-derived cDNA, shown by dashed arrows). 
Analysis of mRNA by microarray using the constant rRNA as a standard is 
made difficult by the relative overabundance of rRNA relative to mRNA; this 

35 problem is circumvented by adding to the hybridization mix a rRNA competitor 
probe which has the same sequence as the microarray's rRNA-cDNA capture 
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probe (both shown as lines marked with an V). By sequestering the excess 
rRNA-derived cDNA, the competitor probe brings down the level of hybridizable 
and hybridized rRNA-derived cDNA to usable levels. 

5 Figure 2: Human ribosomal DNA complete repeating unit (GB accession 
number #U13360). ETS: externally transcribed spacer. ITS: internally 
transcribed spacer. IGS: intergenic spacer. The position of a few rRNA probes 
is shown. 

10 Figure 3: Illustration of spotted DNA capture probes on the slide. The slide 
used for the described experiment carries 12 probe blocks, identified 1 to 12. 
In each block there are 7 rows and 16 columns of spots. Each DNA capture 
probe was spotted in duplicate in an adjacent column (i.e., all odd columns 
correspond to a duplicate column) so there are 8 different DNA probes in a 

15 column. There are a total of 1344 spots on the slide, corresponding to 
duplicates of 463 different DNA capture probes and 209 negative controls (no 
DNA probe). 

Figure 4: Cohybridization of labelled cDNA from Jurkat (reference sample: 
20 Cy3-green) and Jurkat-TPA (test sample: Cy5-red). Ratio images exported from 
GenePix Pro 3.0 (Axon Instruments Inc.) as JPEG (or TIFF) files are 24-bit 
RGB color. 

Figure 5: Cohybridization of labelled cDNA from Jurkat (Cy3-green) and Jurkat- 
25 TPA (Cy5-red). Five (5) ng of rRNA competitor probe 2 was added to the 
hybridization mix to compete for the hybridization of the rRNA-derived cDNA to 
the attached rRNA cDNA capture probe 2. Ratio images exported from 
GenePix Pro 3.0 (Axon Instruments Inc.) as JPEG (or TIFF) files are 24-bit 
RGB color. 

30 

Figure 6: Cohybridization of labelled cDNA from Jurkat (Cy3-green) and Jurkat- 
TPA (Cy5-red). Fifty (50) ng of rRNA competitor probe 2 was added to the 
hybridization mix to compete for the hybridization of the rRNA-derived cDNA to 
the attached rRNA cDNA capture probe 2 (which has the same sequence as 
35 rRNA competitor probe 2). Ratio images exported from GenePix Pro 3.0 (Axon 
Instruments Inc.) as JPEG (or TIFF) files are 24-bit RGB color. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
GLOSSARY 

5 In order to provide a clear and consistent understanding of terms used in the 
present description, a number of definitions are herein provided. 

Array : In the context of this invention, an array is a set of different spotted 
DNA consisting of capture probes for target nucleic acids. Such an array is 
10 described in US Patent No. 5,700,637. 

Complementary DNA (cDNA) : DNA that has been synthesized from RNA by 
the effect of the enzyme reverse transcriptase, converting RNA bases into their 
complements (A to T, U to A, G to C, C to G). 

15 

Cy3, Cy5 : Non-radioactive fluorescent dyes from Amersham Pharmacia 
Biotech that are widely used for labeling DNA in microarray experiments. 

Feature : A feature is a spot (typically of DNA) on a slide. The collection of such 
20 features is called a microarray. 

Hydridization : The process of joining two complementary strands of DNA, or 
one strand each of DNA and RNA, to form a double-stranded molecule. 

25 Messenger RNA (mRNA) : RNA that is used to direct the protein synthesis that 
is part of gene expression. It represents but a small fraction of the total RNA 
found in a cell. 

mRNA-derived cDNA : cDNA synthesized from a mRNA template using 
30 reverse transcriptase and a mRNA-specific primer. 

Microarray-sequestered DNA or DNA capture probe: DNA (single-stranded 
or double-stranded) that are anchored onto the solid surface of a microarray. 
(See fuller description of microarrays immediately following this Glossary.) 



35 



Oligonucleotide : A short strand of single-stranded DNA, typically composed 
of up to 50 bases. 
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10 

Pixel Intensity : The raw intensity of a pixel on a GenePix (Axon Instrument 
Inc.) single-wavelength or ratio image, falling in a range from 0 to 65535. 

PMT : Photomultiplier tubes in scanners used to analyze array images. These 
5 array images are the end products of comparative hybridization experiments. 

Ratio Image : The ratio image is an RGB (Red-Green-Blue) overlay image. In 
this image, wavelength #1 (635 nm) is mapped to the green channel of the 
RGB image, and wavelength #2 (532 nm) is mapped to the red channel. 
10 Superimposing these two images onto each other results in a third, composite 
image, whose color is a blend of the red and green signals. 

Ratio of medians : The ratio of medians is the ratio of the background 
subtracted median pixel intensity at the second wavelength to the background 
15 subtracted median pixel intensity at the first wavelength. 

Reference cDNA : this cDNA originates from a reference sample that is used 
for comparison with another one, called test cDNA obtained from a test sample. 
The reference cDNA serves as a control against which test cDNAs may be 
20 compared to quantify changes in the level of expression of any mRNA found in 
the test sample. Typically, the reference cDNA is labelled with Cy3-dCTP 
(green fluorescent label) when a fluorescent label is used. 



RGB: Red-Green-Blue color. 

25 

Ribosomal RNA (rRNA): structural RNA found in the ribosomes. It is the most 
abundant form of RNA in the cell and does not vary significantly. 

rRNA-cDNA probe: a probe which is designed to hybridize to the rRNA-derived 
30 cDNA found in the hybridization mixture. This probe may be the capture probe, 
which may have the same sequence as the rRNA competitor probe (see below) 
so as to compete with it for the target rRNA-derived cDNA. 



rRNA competitor probe: a DNA oligonucleotide with the same sequence as 
35 part of a ribosomal RNA-cDNA sequence and capable of competing with the 
microarray capture probe for hybridization with a rRNA-derived cDNA. This 
oligonucleotide has the role of competing for the limited space available on the 
rRNA cDNA capture probe bound to the microarray, thus reducing the quantity 
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of rRNA-derived cDNA which can be retained on the microarray and thus 
allowing the use of rRNA-derived cDNA as an « internal standard ». 

rRNA-derived cDNA: cDNA synthesized from a rRNA template using reverse 
5 transcriptase and a rRNA-specific primer. 

Saturation : Saturation refers to the overloading of the photodetection circuitry. 
Saturation can be reduced by reducing the amount of light that is reaching the 
PMTs, which is done by reducing the amount of incident laser light. In practice, 
10 this is accomplished by reducing the voltage of the PMT, which reduces its 
gain. Saturating pixels in GenePix 1.0 are shown as white pixels in the raw 
wavelength images. 

Spotted DNA : Known DNA capture probe that is spotted onto a microarray 
15 slide and used to identify the nucleic acids present in unknown samples (test 
and reference). The spotted DNA could be oligonucleotide or cDNA. 

Test cDNA : cDNA from a cell sample that is to be tested, in comparison with 
a reference sample. Typically, the test cDNA is labelled with Cy5-dCTP (red 
20 fluorescent label) when a fluorescent label is used. 

Microarrays are made from a collection of purified DNAs. A drop of each type 
of DNA in solution is placed onto a specially-prepared glass microscope slide 
by an arraying machine. The arraying machine can quickly produce a regular 
grid of thousands of spots in a square about 2 cm on a side, small enough to 

25 fit under a standard slide cover slip. The DNA in the spots is bound to the glass 
to keep it from washing off during the hybridization reaction. The choice of DNA 
to be used within the spots on a microarray' s surface determines which genes 
can be detected in a comparative hybridization assay. These DNA probes could 
be synthetic oligonucleotides or PCR amplified DNA (hence the terms "oligo 

30 microarray" and "cDNA microarray"). 

The invention relates to rRNA used as an internal standard for the 
normalization of the fluorescence intensities in microarray analysis experiments. 
This can provide an estimate of relative abundance of multiple mRNAs and 
allow direct comparison between two RNA samples. 
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Use of rRNA for normalization provides a sound method of identifying 
differentially expressed genes between two samples because its percentage of 
abundance in total RNA does not vary through the cell cycle or with a particular 
treatment. 

5 

In order to detect the difference in gene expression between two samples on 
a single microarray slide, the RNA should be reverse transcribed to cDNA and 
labelled with two different fluorophores prior to cohybridizing both samples to 
the same slide and same spots simultaneously. There are several techniques 

10 that allow labeling of cDNA. Direct labeling is done by the incorporation of a 
fluorescent nucleotide such as, for example, Cy3-dCTP (green) or Cy5-dCTP 
(red) (from Amersham-Pharmacia Biotech), during the reverse transcription 
reaction. Other protocols may be used for labeling the cDNA following the 
reverse transcription reaction (indirect labeling). Alternatively, the cDNA can be 

15 used for RNA amplification involving 17 polymerase. This method relies on 
attaching a T7 promoter sequence to the reverse transcriptase primer used for 
synthesis of the first cDNA strand. After second strand cDNA synthesis, one 
can generate amplified RNA (aRNA) using T7 RNA polymerase and the double- 
stranded cDNA molecules as targets for the linear amplification. Those targets 

20 can then be labelled directly or indirectly. 

In the present invention, the reverse transcriptase reaction for the cDNA 
labeling step involves the use of two kinds of reverse transcriptase primers in 
the same reaction: an oligo-dT and specific primers for rRNA (5.8S, 18S or 28S 
25 rRNA). One set of RNA to be reverse transcribed is all the polyA+ mRNA that 
is present in the RNA sample, the other set is the rRNA. Both sets are labelled 
in the same sample with the same label. Random short primer like random 
hexamers or sets of specific primers could also be used as alternative methods 
to reverse transcribe all the polyA+ mRNA. 

30 

In a typical experiment, the reference cDNA is labelled with Cy3 and the test 
cDNA is prepared in the presence of Cy5. Both of these cDNA populations are 
hybridized to the same spotted DNA capture probes on the microscope slide. 
After the hybridization and washing steps, the slide is scanned at the 
35 appropriate wavelengths and an image is generated for each wavelength. In the 
derived ratio image, a red spot indicates that the test cDNA for this feature is 
more abundant than the reference cDNA which means that the test cDNA is 
being expressed at a level higher than the reference cDNA; a yellow spot 
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means that there is no change, in the expression level between the two 
populations of test and reference cDNA. In order to measure changes in gene 
expression numerically, image analysis software like GenePix 1.0 (Axon 
Instruments, Inc.) extracts the intensity of a given feature (spot) from an image 
5 and performs a number of computations on the raw data. In this kind of 
comparative analysis, normalization is essential to compensate for variations 
in RNA isolation techniques, initial quantification errors, tube to tube variation 
in reverse transcriptase reactions and other experimental variations. That is 
where the present invention intervenes : normalization is possible upon 
10 correcting the green intensity and the red intensity of the spot having the 
internal standard capture probe to achieve a ratio of 1. This normalization 
therefore leads to the obtention of a correction factor that is applied to the 
intensities of signals specific to each reference and test samples. 

15 The end product of a comparative hybridization experiment is a scanned array 
image. Saturated pixels appear when there are more photons detected than 
can be processed by the photomultiplier tubes (PMT) of the scanner. This 
occurs when the amount of hybridized target per shot is too high. Saturated 
pixels cannot be used for proper measurement of the signal intensity. PMT 

20 should then be set to avoid the detection of saturated pixels. As a 
consequence, this reduces the signal intensity of all other spots and low levels 
of cDNA will not be detected. 

In the present invention, the hybridization step is performed with specific 
25 amounts of free rRNA-derived cDNA (competitor probe) added into the 
hybridization buffer so as to set up a competition for ribosomal cDNA of the test 
cDNA and of the reference cDNA (if the latter is part of the experiment) with the 
capture probe. For efficient competition, the competition probe should be 
nearly identical to the capture probe or have a high level of overlapping 
30 sequences therewith. The hybridization efficiency of the rRNA-derived cDNA 
with the capture probe can be predictably and reproducibly altered. Reducing 
the hybridization of these internal and abundant targets in microarray 
experiments has the effect of generating a signal intensity in the same dynamic 
range of detection as the less abundant targets in microarrays. 

35 

The competition is important because the control must be detected at a level 
similar to the test transcript. If one target is present at a significantly higher 
concentration than the other, the PMT (laser voltage) has to be reduced to 
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avoid a saturated signal, with the consequence of reducing all the other signals. 
The ability to obtain quantitative information for low abundant mRNA will then 
be lost. 

5 With the applicants 7 invention, the normalization factor is computed using the 
ratio of intensity obtained between the signal detected for the test cDNA and 
that of the reference cDNA. This ratio should be 1.0. For example, if the ratio 
is 0.8, a normalization factor of 1.25 would have to be calculated (1/0.8). The 
analyzed data is then corrected using this factor. If the normalization factor is 
10 greater than 2 (or less than 0.5) the slide is usually rescanned with other PMT 
voltage to ensure maximum data integrity. 

RESULTS 

15 The applicants used the products and protocols that are described herein, 
which results in proper normalization. 

Figure 1 illustrates how a given sample (reference or test) is labelled and 
hybridized to capture probes (a plurality of specific cDNA probed spots and one 
20 internal standard probe spot). The labelled ribosomal cDNA is mixed with a 
competitor probe that is here identical to the capture probe. 

Figure 2 illustrates the organization of the rDNA locus. The microarray was 
made from a collection of synthetic DNA oligonucleotides as DNA probes. 

25 

Figure 3 illustrates the positions of spotted DNA capture probes on the slide. 
In order to use the cDNA made from rRNA for normalisation, a DNA capture 
probe having a sequence that is complementary to the rRNA-derived cDNA has 
also been spotted on the array slide. 

30 

Table 1 shows the sequences of two DNA probes designed for that purpose. 
3D-Link Activated slides from Surmodics Inc. were used according to the 
supplier's protocol for the covalent attachment of the 5' amino modified 
oligonucleotides and prehybridization treatment of the slides. On the DNA 
35 microarray used here, each spot contains approximately 0.1 5ng of bound DNA 
probe. 
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The cDNA for microarray analysis was prepared from RNA templates by 
incorporation of fluorescent-labelled deoxyribonucleotides during first strand 
cDNA synthesis. 10p,g of total RNA extract from Jurkat and Jurkat-TPA cell 
lines (Geneka Biotechnology) was used. Priming of cDNA synthesis was 
5 performed using 2^g of oligo (dT). For each labeling reaction, 50 ng of 18S 
primer were included to allow reverse transcription of the 18S rRNA. Table 1 
shows the sequences of the 18S reverse transcriptase primer. In this 
experiment, labelled reference cDNA from Jurkat total RNA was prepared using 
Cy3-dCTP while Jurkat-TPA total RNA was reverse transcribed and labelled 
10 using Cy5-dCTP (Amersham Pharmacia Biotech) to produce labelled test 
cDNA. Reverse transcriptase reactions were performed using the Superscript 
II reverse transcriptase (LifeTechnologies) enzyme according to the supplier's 
protocol. 

15 For the hybridization and washing steps the following conditions were used 
(optimized conditions for 3D-Link Activated slides, Surmodics Inc.). Labelled 
cDNAs were cohybridized in 5x SSC-0.1% SDS buffer for 16 hours at 45°C. 
Washing was performed by incubating slides two times 15 minutes in 2x SSC- 
0.1% SDS at 45°C, one time 5 minutes in 0.2x SSC at room temperature and 

20 one time 5 minutes in 0.1 x SSC at room temperature. Slides were dried by low 
speed centrifugation. 

The test and reference cDNAs were analyzed through hybridization with the 
microarray-sequestered cDNA. In this type of experiment, if the test or 
25 reference cDNA contains a sequence that is complementary to the DNA on a 
given spot, that cDNA will hybridize to the spot, where it will be detectable by 
virtue of its fluorescence. 

Figure 4 shows a ratio image of a typical cohybridized cDNA with no internal 
30 standard according to the invention. The target cDNAs and the results are 
listed in Table 2 (see right column). Figures 5 and 6 show counterparts of 
arrays of Figure 4 but with 5 ng and 50 ng of ribosomal competitor probe, 
respectively, in accordance with this invention. The results are listed in Table 
2, in the middle and left columns, respectively. 
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Saturated spots were observed for the two rRNA cDNA probes (DNA probe 1 
and probe 2). The GenePix 3.0 software (Axon Instruments Inc.) was used to 
extract the intensity of each feature (hybridized spot) from the image. Table 2 
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shows the mean value of pixel intensity for each spot. To analyse feature 
intensity and calculate a ratio, the local background should be subtracted from 
the median value of the pixel. The method used by GenePix Pro 3.0 for 
determining the background intensity is a local background subtraction 
5 technique. A different background is therefore computed for each individual 
feature-indicator and the median value of the background pixel intensities are 
reported (Table 2). 

The end product of a comparative hybridization experiment is a scanned array 
10 image. Saturated pixels appear when there are more photons detected than the 
photomultiplier tubes (PMT) of the scanner can process. This occurs when the 
amount of hybridized cDNA to the spot is too high. Saturated pixels cannot be 
used for proper meaurement of the signal intensity. PMT should then be set to 
avoid the detection of saturated pixels. As a consequence, this reduces the 
1 5 signal intensity of all other spots, and lower levels of cDNA will not be detected. 

Because of the high abundance of the rRNA-derived cDNA relatively to the 
mRNA-derived cDNA, it is important to reduce its hybridization to the 
microarray-sequestered DNA. In this invention, the applicants compete the 

20 hybridization of the rRNA-derived cDNA to the microarray DNA capture probe 
by adding a defined amount of rRNA competitor probe in the hybridization 
buffer, said probe carrying the same sequence as the microarray-bound probe. 
Five (5) to 100 molar excess of competitor probe relative to the quantity of 
microarray DNA capture probe is enough to obtain a rRNA-derived cDNA signal 

25 intensity in the same dynamic range of detection as the other cDNAs (i.e., test 
and/or reference mRNA-derived cDNA), which are otherwise present in much 
lesser quantities in the reaction buffer. The amount of molar excess to be used 
is essentially a function of the amount of the total RNA used for the assay (for 
example : 0.2 to 20 pg). 

30 

In short, because of their relatively invariant expression across tissues and 
treatments, 18S and 28S RNA are ideal internal controls for quantitative RNA 
analysis by microarrays. The current invention describes how to use these 
rRNAs to that end by compensating, thanks to competition with specific oligos, 
35 for their overabundance relative to the mRNA of test and reference cell 
samples. 
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The overall exhaustive results of comparison of test and reference cDNAs, 
normalized in accordance with the method and principles of the present 
invention, are provided in appendix 1. 

Although the present invention has been described hereinabove by way of 
preferred embodiments thereof, it can be modified, without departing from the 
spirit and nature of the subject invention, as defined in the appended claims. 
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Appendix 1: Signal normalization using 18S RNA as an internal standard. Two 
microarray analyses were performed independently, each one comparing the 
expression of many transcription factors in Jurkat cells and in Jurkat cells 
treated with the phorbol ester TPA. The signals obtained in the latter case were 
5 divided by the signals obtained in the former case to get a ratio of induction by 
TPA in these cells. The signals were normalized using 18S RNA as a standard 
(see columns 3 and 4). Since 18S RNA is used as a control in both experiments 
and that the same type of cells were used, presumably giving very similar 
results, the ratio of the results obtained in each experiment should be nearing 
10 1 . That ratio is presented in column 5. 



Column 1 


Column 2 


Column 3 


Column 4 


Column 5 


Gene name 


Accession 


Jurkat/Jurkat 


Jurkat/Jurkat TPA Ratio of 






TPA 








number 


ratio 


ratio 


experiments 






experiment 1 


experiment 2 


1 and 2 



9G8 splicing factor 


L22253 


0.84 


1.00 


0.836078512 


9G8 splicing factor 


L22253 


0.77 


0.99 


0.779340183 


A-Myb 


X66087 


1.32 


1.38 


0.950679679 


A-Myb 


X66087 


1.34 


1.43 


0.937305665 


A-Myb 


X13294 


1.12 


1.21 


0.924150275 


A-Myb 


X13294 


1.12 


1.21 


0.924083463 


ABF-1 


AF060154 


0.45 


0.39 


1.166895465 


ABF-1 


AF060154 


0.39 


0.38 


1.029207795 


ABH 


NMJD06020 


0.91 


1.05 


0.865303363 


ABH 


NM_006020 


0.81 


0.98 


0.822950019 


ABP/ZF 


U82613 
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AF10 


NM_004641 
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1.31 
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1.23 
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